Fix modelbuilder discrepancy on benchmarking #226

titaiwangms · 2025-09-19T21:37:29Z

Modelbuilder has different ordering on past kv cache, so we could not get the correct match on results with pyTorch models. This PR addresses this with re-ordering function (fully annotated).

NOTE: I will have a follow up PR to simplify and legitimize get_inputs to test the real world case of LLM: (1) prompt-processing (sequence length > 1 without KV cache) and (2) token generation (sequence length==1 with KV cache)

fix modelbuilder discrepancy

1b1ac83

titaiwangms requested a review from xadupre September 19, 2025 21:37

loose abs

8a6797e

titaiwangms changed the title ~~Fix modelbuilder discrepancy~~ Fix modelbuilder discrepancy on benchmarking Sep 19, 2025

justinchuby approved these changes Sep 19, 2025

View reviewed changes

sdpython approved these changes Sep 20, 2025

View reviewed changes

sdpython merged commit 34ccaab into sdpython:main Sep 20, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix modelbuilder discrepancy on benchmarking #226

Fix modelbuilder discrepancy on benchmarking #226

Uh oh!

titaiwangms commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix modelbuilder discrepancy on benchmarking #226

Fix modelbuilder discrepancy on benchmarking #226

Uh oh!

Conversation

titaiwangms commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants